super-resolution generative adversarial network
Super-Resolution Generative Adversarial Networks based Video Enhancement
Çetin, Kağan, Akça, Hacer, Gerek, Ömer Nezih
This study introduces an enhanced approach to video super-resolution by extending ordinary Single-Image Super-Resolution (SISR) Super-Resolution Generative Adversarial Network (SRGAN) structure to handle spatio-temporal data. While SRGAN has proven effective for single-image enhancement, its design does not account for the temporal continuity required in video processing. To address this, a modified framework that incorporates 3D Non-Local Blocks is proposed, which is enabling the model to capture relationships across both spatial and temporal dimensions. An experimental training pipeline is developed, based on patch-wise learning and advanced data degradation techniques, to simulate real-world video conditions and learn from both local and global structures and details. This helps the model generalize better and maintain stability across varying video content while maintaining the general structure besides the pixel-wise correctness. Two model variants--one larger and one more lightweight--are presented to explore the trade-offs between performance and efficiency. The results demonstrate improved temporal coherence, sharper textures, and fewer visual artifacts compared to traditional single-image methods. This work contributes to the development of practical, learning-based solutions for video enhancement tasks, with potential applications in streaming, gaming, and digital restoration.
- Research Report > New Finding (0.66)
- Research Report > Promising Solution (0.46)
- Health & Medicine (0.46)
- Information Technology (0.46)
- Leisure & Entertainment (0.46)
Generative models in a nutshell. Have you ever wished you could create…
Have you ever wished you could create your own realistic images, translate text into multiple languages, or even compose original music? If so, you'll be interested to learn about generative models -- a type of machine learning algorithm that has the power to bring these capabilities to life. Generative models are trained on a dataset and learn the underlying distribution of the data. This allows them to generate new, synthetic examples that are similar to the original dataset. But what makes generative models so special?
Applying Physics-Informed Enhanced Super-Resolution Generative Adversarial Networks to Finite-Rate-Chemistry Flows and Predicting Lean Premixed Gas Turbine Combustors
The accurate prediction of small scales in underresolved flows is still one of the main challenges in predictive simulations of complex configurations. Over the last few years, data-driven modeling has become popular in many fields as large, often extensively labeled datasets are now available and training of large neural networks has become possible on graphics processing units (GPUs) that speed up the learning process tremendously. In fact, the successful application of deep neural networks in fluid dynamics, such as for underresolved reactive flows, is still challenging. This work advances the recently introduced PIESRGAN to reactive finite-rate-chemistry flows. However, since combustion chemistry typically acts on the smallest scales, the original approach needs to be extended. Therefore, the modeling approach of PIESRGAN is modified to accurately account for the challenges in the context of laminar finite-rate-chemistry flows. The modified PIESRGAN-based model gives good agreement in a priori and a posteriori tests in a laminar lean premixed combustion setup. Furthermore, a reduced PIESRGAN-based model is presented that solves only the major species on a reconstructed field and employs PIERSGAN lookup for the remaining species, utilizing staggering in time. The advantages of the discriminator-supported training are shown, and the usability of the new model demonstrated in the context of a model gas turbine combustor.
AI-Assisted Terahertz Imaging for Acute Lung Injury Therapy
In a study published in Biosensors, a label-free and rapid identification technique based on artificial intelligence (AI)-assisted terahertz imaging is proposed to identify the therapeutic effect of low-level laser therapy for acute lung injury. Acute lung injury is a severe respiratory failure characterized by hypoxemia and the development of bilateral infiltrates on chest radiographs and is sometimes referred to as acute respiratory distress syndrome. Acute lung injury continues to substantially cause high mortality and morbidity in critically sick patients. Due to the ineffectiveness of current therapies for acute lung injury, it is critical to develop an effective therapeutic approach to significantly lower mortality and enhance the life expectancy of patients. Low-level laser therapy holds significant potential as a noninvasive therapeutic technique in biomedicine.
- Health & Medicine > Therapeutic Area (0.78)
- Health & Medicine > Nuclear Medicine (0.71)
- Health & Medicine > Diagnostic Medicine > Imaging (0.71)
The First Principles of Deep Learning and Compression
The deep learning revolution incited by the 2012 Alexnet paper has been transformative for the field of computer vision. Many problems which were severely limited using classical solutions are now seeing unprecedented success. The rapid proliferation of deep learning methods has led to a sharp increase in their use in consumer and embedded applications. One consequence of consumer and embedded applications is lossy multimedia compression which is required to engineer the efficient storage and transmission of data in these real-world scenarios. As such, there has been increased interest in a deep learning solution for multimedia compression which would allow for higher compression ratios and increased visual quality. The deep learning approach to multimedia compression, so called Learned Multimedia Compression, involves computing a compressed representation of an image or video using a deep network for the encoder and the decoder. While these techniques have enjoyed impressive academic success, their industry adoption has been essentially non-existent. Classical compression techniques like JPEG and MPEG are too entrenched in modern computing to be easily replaced. This dissertation takes an orthogonal approach and leverages deep learning to improve the compression fidelity of these classical algorithms. This allows the incredible advances in deep learning to be used for multimedia compression without threatening the ubiquity of the classical methods. The key insight of this work is that methods which are motivated by first principles, i.e., the underlying engineering decisions that were made when the compression algorithms were developed, are more effective than general methods. By encoding prior knowledge into the design of the algorithm, the flexibility, performance, and/or accuracy are improved at the cost of generality...
- Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- North America > United States > Maryland > Prince George's County > College Park (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Overview (1.00)
- Summary/Review (0.87)
- Leisure & Entertainment (1.00)
- Information Technology (1.00)
- Education (1.00)
- (2 more...)
Using Physics-Informed Super-Resolution Generative Adversarial Networks for Subgrid Modeling in Turbulent Reactive Flows
Bode, Mathis, Gauding, Michael, Lian, Zeyu, Denker, Dominik, Davidovic, Marco, Kleinheinz, Konstantin, Jitsev, Jenia, Pitsch, Heinz
Turbulence is still one of the main challenges for accurately predicting reactive flows. Therefore, the development of new turbulence closures which can be applied to combustion problems is essential. Data-driven modeling has become very popular in many fields over the last years as large, often extensively labeled, datasets became available and training of large neural networks became possible on GPUs speeding up the learning process tremendously. However, the successful application of deep neural networks in fluid dynamics, for example for subgrid modeling in the context of large-eddy simulations (LESs), is still challenging. Reasons for this are the large amount of degrees of freedom in realistic flows, the high requirements with respect to accuracy and error robustness, as well as open questions, such as the generalization capability of trained neural networks in such high-dimensional, physics-constrained scenarios. This work presents a novel subgrid modeling approach based on a generative adversarial network (GAN), which is trained with unsupervised deep learning (DL) using adversarial and physics-informed losses. A two-step training method is used to improve the generalization capability, especially extrapolation, of the network. The novel approach gives good results in a priori as well as a posteriori tests with decaying turbulence including turbulent mixing. The applicability of the network in complex combustion scenarios is furthermore discussed by employing it to a reactive LES of the Spray A case defined by the Engine Combustion Network (ECN).